I claim: 



1 . A method for determining a word entered using a reduced keypad, where each of one 
or more keys of the reduced keypad is mapped to a plurality of letters, the method 
comprising: 

5 receiving key input corresponding to the word, the key input having at least one of a 
left context and a right context; and, 

determining the word corresponding to the key input by using a machine learning 
approach with a language model based on one or more of the at least one of the left 
context and the right context of the key input. 

10 2. The method of claim 1 , wherein the reduced keypad is a numeric keypad. 

3. The method of claim 1, wherein the key input has at least the left context, and the 
word corresponding to the key input is determined by using the machine learning 
approach based on the left context of the key input. 

4. The method of claim 1, wherein the key input has at least the right context, and the 
15 word corresponding to the key input is determined by using the machine learning 

approach based on the right context of the key input. 

5. The method of claim 1, wherein the key input has both the left context and the right 
context, and the word corresponding to the key input is determined by using the machine 
learning approach based both on the left context and the right context of the key input. 
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6. The method of claim 1, wherein using the language model comprises using an n-gram 
model 

7. The method of claim 6, wherein using the n-gram model comprises using a bigram 
model. 

5 8 . The method of claim 1 , wherein using the machine learning approach further 
comprises using a cache model in addition to the language model. 

9. The method of claim 1, wherein the language model comprises a compressed 
language model. 

10. The method of claim 1, wherein the key input has at least the left context, the word 
10 corresponding to the key input determined by using the machine learning approach based 

on the left context of the key input, and using the machine learning approach comprises 
using a bigram model as the language model, comprising: 

for each word in a vocabulary that is consistent with the key input, determining a 
probability of the word given the left context, and adding the word and the probability of 
15 the word to an array of word-probability pairs; and, 

sorting the array of word-probability pairs in decreasing order of probability. 

11. The method of claim 10, wherein using the machine learning approach comprises 
using a cache model in addition to using the bigram model, such that the probability of 
the word is determined given the left context and a cache. 
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12. The method of claim 10, wherein using the bigram model further comprises: 

for each word in the vocabulary that is consistent with the key input as an initial part 
of the word, determining a probability of the word given the left context, and, upon 
determining that the probability is greater than a greatest probability so far determined, 
5 setting the greatest probability to the probability and a greatest probability word 
associated with the greatest probability to the word; 

upon determining that the greatest probability is at least a number of times greater 
than a word of a first word-probability pair of the array of word-probability pairs, 
inserting the greatest probability word associated with the greatest probability and the 
1 0 greatest probability as a new first word-probability pair before the first word-probability 
pair within the array. 

13. The method of claim 12, wherein using the machine learning approach comprises 
using a cache model in addition to using the bigram model, such that the probability of 
the word is determined given the left context and a cache. 

15 14. The method of claim 1 , wherein the key input has both the left context and the right 
context and has a plurality of number sequences where each sequence corresponds to a 
word, the plurality of words corresponding to the key input determined by using the 
machine learning approach based both on the left context and the right context of the key 
input. 

20 15. The method of claim 1, wherein the language model comprises a compressed 

language model, the compressed language model compressed by performing a method 
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comprising: 

smoothing an uncompressed language model; and, 

pruning the uncompressed language model to yield the compressed language model 

16. The method of claim 15, wherein pruning the uncompressed language model 

5 comprises using one of: a count-cutoffs approach, a Rosenfeld pruning approach, and a 
Stolcke pruning approach 

17. The method of claim 15, wherein pruning the uncompressed language model 
comprises determining a normalization factor for each word in the uncompressed model 
only prior to pruning. 

10 18. The method of claim 1 5, wherein pruning the uncompressed language model accounts 
for ambiguous words in the uncompressed model 

19. The method of claim 15, wherein pruning the uncompressed language model accounts 
for an effect of the pruning on key input accuracy. 

20. The method of claim 1, wherein the method is performed by execution of a computer 
1 5 program by a processor from a computer-readable medium. 

21. A computer-readable medium having instructions stored thereon for execution by a 
processor to perform a method for determining a word entered using a reduced keypad, 
where each of one or more input keys of the reduced keypad is mapped to a plurality of 
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letters, the method comprising: 

receiving key input corresponding to the word, the key input having a left context; 

for each word in a vocabulary that is consistent with the key input, determining a 
probability of the word given the left context, and adding the word and the probability of 
5 the word to an array of word-probability pairs; 

determining the word corresponding to the key input as a word of a word-probability 
pair within the array of word-probability pairs having a greatest probability. 

22. The medium of claim 21, wherein the reduced keypad is a numeric keypad. 

23. The medium of claim 21, wherein determining the word corresponding to the key 
10 input comprises: 

sorting the array of word-probability pairs in decreasing order of probability; and, 
determining the word corresponding to the key input as a word of a first word- 
probability pair within the array of word-probability pairs. 

24. The medium of claim 21, the method further initially comprising, for each word in a 
15 cache that is consistent with the key input, determining a probability of the word given 

the left context, and adding the word and the probability of the word to an array of word- 
probability pairs. 

25. The medium of claim 21, the method further comprising prior to determining the 
word corresponding to the key input: 

20 for each word in the vocabulary that is consistent with the key input as an initial part 
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of the word, determining a probability of the word given the left context, and, upon 
determining that the probability is greater than a greatest probability so far determined, 
setting the greatest probability to the probability and a greatest probability word 
associated with the greatest probability to the word; 
5 upon determining that the greatest probability is significantly more likely than a word 
of a first word-probability pair of the array of word probability-pairs, adding the greatest 
probability word associated with the greatest probability and the greatest probability as a 
new first word-probability pair to the array. 

26. The medium of claim 25, the method further initially comprising prior to determining 
1 0 the word corresponding to the key input, for each word in a cache that is consistent with 
the key input as an initial part of the word, determining a probability of the word given 
the left context, and, upon determining that the probability is greater than the greatest 
probability so far determined, setting the greatest probability to the probability and a 
greatest probability word associated with the greatest probability to the word. 

15 27. A method for determining a word entered using a reduced keypad, where each of one 
or more keys of the reduced keypad is mapped to a plurality of letters, the method 
comprising: 

receiving key input corresponding to the word, the key input having at least one of a 
left context and a right context; 
20 determining the word corresponding to the key input by using a compressed language 

model based on one or more of the at least one of the left context and the right context of 
the key input, 
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wherein the compressed language model is compressed by performing a method 
comprising: 

smoothing an uncompressed language model; and, 

pruning the uncompressed language model to yield the compressed language model. 
5 28. The method of claim 27, wherein the reduced keypad is a numeric keypad. 

29. The method of claim 27, wherein pruning the uncompressed language model 
comprises using one of: a count-cutoffs approach, a Rosenfeld pruning approach, and a 
Stolcke pruning approach 

30. The method of claim 27, wherein pruning the uncompressed language model 

1 0 comprises determining a normalization factor for each word in the uncompressed model 
only prior to pruning. 

31. The method of claim 27, wherein pruning the uncompressed language model accounts 
for ambiguous words in the uncompressed model. 

32. The method of claim 27, wherein pruning the uncompressed language model accounts 
15 for an effect of the pruning on key input accuracy. 

33. The method of claim 27, wherein the method is performed by execution of a 
computer program by a processor from a computer-readable medium. 
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34. An apparatus comprising: 

a plurality of keys, each of one or more of the keys mapped to a plurality of letters, 
the plurality of keys used to enter key input corresponding to a word, the key input 
having at least one of a left context and a right context; and, 
5 a word-determining logic designed to determine the word corresponding to the key 

input by using a machine learning approach with a language model based on one or more 
of the at least one of the left context and the right context of the key input. 

35. The apparatus of claim 34, further comprising a display on which the at least one of 
the left context and the right context, and the word corresponding to the key input, are 

10 displayed. 

36. The apparatus of claim 34, wherein the apparatus is a telephone. 

37. The apparatus of claim 36, wherein the apparatus is a mobile telephone. 

38. The apparatus of claim 36, wherein the apparatus is one of: a cellular telephone, a 
corded telephone, a cordless telephone, a digital telephone, and a radio telephone. 

15 39. The apparatus of claim 34, wherein the apparatus is one of: a pager, a desktop 
computer, a laptop computer, a handheld device, a personal-digital assistance (PDA) 
device, and a remote control device. 
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40. The apparatus of claim 34, wherein the word-determining logic comprises a computer 
program stored on a computer-readable medium for execution by a processor. 



41. The apparatus of claim 34, wherein the key input has at least the left context, and the 
word corresponding to the key input is determined by the word-determining logic by 

5 using the machine learning approach based on the left context of the key input. 

42. The apparatus of claim 34, wherein the key input has at least the right context, and the 
word corresponding to the key input is determined by the word-determining logic by 
using the machine learning approach based on the right context of the key input. 

43. The apparatus of claim 34, wherein the key input has both the left context and the 
10 right context, and the word corresponding to the key input is determined by the word- 
determining logic by using the machine learning approach based both on the left context 
and the right context of the key input. 

44. The apparatus of claim 34, wherein the word-determining logic uses a cache model. 
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